Overview

Dataset info

Number of variables59
Number of observations109200
Missing cells1007073 (15.6%)
Duplicate rows0 (0.0%)
Total size in memory190.6 MiB
Average record size in memory1.8 KiB

Variables types

CAT35
NUM15
BOOL8
DATE1

Reproduction info

Date of analysis2020-04-27 11:12:49.024998
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

Alternative_purchasing_group has 4291 (3.9%) missing values Missing
Alternative_purchasing_group has a high cardinality: 566 distinct values Warning
ATC has a high cardinality: 1124 distinct values Warning
Chemical_/_biological has 9190 (8.4%) missing values Missing
consumption has 8749 (8.0%) zeros Zeros
consumption_previous_year has 4354 (4.0%) zeros Zeros
Current_consumption has 9906 (9.1%) zeros Zeros
Dangerous_material_is_charged_with_a_permit has 96190 (88.1%) missing values Missing
dangerous_substance has 96189 (88.1%) missing values Missing
Date_of_basket_entry has 49905 (45.7%) missing values Missing
Date_of_basket_entry has a high cardinality: 55 distinct values Warning
Description_Alternative_purchasing_group has 4291 (3.9%) missing values Missing
Description_Alternative_purchasing_group has a high cardinality: 566 distinct values Warning
Description_outline has 8322 (7.6%) missing values Missing
Description_outline has a high cardinality: 307 distinct values Warning
European_patent_expires has 107645 (98.6%) missing values Missing
For_adults has 104554 (95.7%) missing values Missing
Form_of_giving has 64017 (58.6%) missing values Missing
GENERY_FATHER has 64846 (59.4%) missing values Missing
GENERY_FATHER has a high cardinality: 686 distinct values Warning
Inventory has 12220 (11.2%) zeros Zeros
Inventory_of_consumption_months is highly skewed (γ1 = 84.76938652) Skewed
Inventory_of_consumption_months has 12700 (11.6%) zeros Zeros
Main_outline has 8322 (7.6%) missing values Missing
Main_outline has a high cardinality: 307 distinct values Warning
Narcotic_/_psychotropic has 49980 (45.8%) missing values Missing
Narcotic_/_psychotropic has 55580 (50.9%) zeros Zeros
Plant has constant value "7350.0" Rejected
Prediction has 10520 (9.6%) zeros Zeros
PRICE is highly skewed (γ1 = 24.26847489) Skewed
Price_for_absolute_packaging has 1632 (1.5%) zeros Zeros
Quantity_in_absolute_packaging has 1631 (1.5%) zeros Zeros
Quantity_in_Packaging-Absolute has 1467 (1.3%) missing values Missing
Quantity_in_packing-relative has 3430 (3.1%) missing values Missing
Quantity_in_packing-relative has 73588 (67.4%) zeros Zeros
Safety_Stock has 11178 (10.2%) zeros Zeros
Send_code_to_Omri has constant value "1.0" Rejected
Serving_form has 9531 (8.7%) missing values Missing
Serving_form has a high cardinality: 53 distinct values Warning
skucode2 has a high cardinality: 3281 distinct values Warning
status has constant value "1.0" Rejected
Toxic_item has 8884 (8.1%) missing values Missing
type_of_packeging has 102708 (94.1%) missing values Missing
U#S#_Patent_Expires has 107330 (98.3%) missing values Missing
Validity_of_Ministry_of_Health_registration has 103787 (95.0%) missing values Missing
Validity_of_Ministry_of_Health_registration has a high cardinality: 83 distinct values Warning
VENDOR has a high cardinality: 155 distinct values Warning
consumption_previous_year is highly correlated with consumption and 2 other fieldsHigh Correlation
consumption is highly correlated with consumption_previous_year and 2 other fieldsHigh Correlation
Current_consumption is highly correlated with consumption and 2 other fieldsHigh Correlation
Prediction is highly correlated with consumption and 2 other fieldsHigh Correlation
Quantity_in_Packaging-Absolute is highly correlated with Quantity_in_absolute_packagingHigh Correlation
Quantity_in_absolute_packaging is highly correlated with Quantity_in_Packaging-AbsoluteHigh Correlation
Safety_Stock is highly correlated with InventoryHigh Correlation
Inventory is highly correlated with Safety_StockHigh Correlation

Variables

Distinct count38
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
20190
17706
20100
13499
20120
11289
20130
 
9307
20210
 
7186
Other values (33)
50213
ValueCountFrequency (%) 
20190 17706 16.2%
 
20100 13499 12.4%
 
20120 11289 10.3%
 
20130 9307 8.5%
 
20210 7186 6.6%
 
20140 6236 5.7%
 
20270 5181 4.7%
 
20220 4568 4.2%
 
20180 4468 4.1%
 
20110 3636 3.3%
 
Other values (28) 26124 23.9%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length5
Mean length4.995924908
Min length4
Scatter

ABC
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
C
59649
B
36448
A
13103
ValueCountFrequency (%) 
C 59649 54.6%
 
B 36448 33.4%
 
A 13103 12.0%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter
Distinct count39
Unique (%)< 0.1%
Missing187
Missing (%)0.2%
Memory size853.2 KiB
20
17446
11
13890
13
11552
24
10966
14
9608
Other values (33)
45551
ValueCountFrequency (%) 
20 17446 16.0%
 
11 13890 12.7%
 
13 11552 10.6%
 
24 10966 10.0%
 
14 9608 8.8%
 
21 7346 6.7%
 
15 6314 5.8%
 
22 4891 4.5%
 
19 4767 4.4%
 
17 3448 3.2%
 
Other values (28) 18785 17.2%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length2.003424908
Min length2
Scatter

Alternative_purchasing_group
Categorical

MISSING
HIGH CARDINALITY
Distinct count566
Unique (%)0.5%
Missing4291
Missing (%)3.9%
Memory size853.2 KiB
1734
 
3548
1591
 
2260
1875
 
2037
1068
 
1817
1823
 
1622
Other values (560)
93625
ValueCountFrequency (%) 
1734 3548 3.2%
 
1591 2260 2.1%
 
1875 2037 1.9%
 
1068 1817 1.7%
 
1823 1622 1.5%
 
1317 1570 1.4%
 
1053 1466 1.3%
 
1642 1231 1.1%
 
1027 1161 1.1%
 
1354 1130 1.0%
 
Other values (555) 87067 79.7%
 
(Missing) 4291 3.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length4
Min length4
Scatter

ATC
Categorical

HIGH CARDINALITY
Distinct count1124
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
1/0/1900
 
8221
ATC:
 
1317
ATC:M01AE01
 
1068
ATC:N03AX16
 
763
ATC:C10AA07
 
745
Other values (1119)
97086
ValueCountFrequency (%) 
1/0/1900 8221 7.5%
 
ATC: 1317 1.2%
 
ATC:M01AE01 1068 1.0%
 
ATC:N03AX16 763 0.7%
 
ATC:C10AA07 745 0.7%
 
ATC:A11AA06 731 0.7%
 
ATC:C10AA05 731 0.7%
 
ATC:N06BA04 711 0.7%
 
ATC:A11CC05 659 0.6%
 
ATC:A07FA01 639 0.6%
 
Other values (1114) 93615 85.7%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length11
Mean length10.52885531
Min length4
Scatter
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
EA
91354
PAC
 
17403
KIT
 
386
BLS
 
29
BT
 
27
ValueCountFrequency (%) 
EA 91354 83.7%
 
PAC 17403 15.9%
 
KIT 386 0.4%
 
BLS 29 < 0.1%
 
BT 27 < 0.1%
 
KG 1 < 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length3
Mean length2.163168498
Min length2
Scatter

Budget_Group
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
B
51894
E
27954
A
22502
D
 
6715
H
 
135
ValueCountFrequency (%) 
B 51894 47.5%
 
E 27954 25.6%
 
A 22502 20.6%
 
D 6715 6.1%
 
H 135 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

Chemical_/_biological
Categorical

MISSING
Distinct count3
Unique (%)< 0.1%
Missing9190
Missing (%)8.4%
Memory size853.2 KiB
1
96404
2
 
3606
ValueCountFrequency (%) 
1 96404 88.3%
 
2 3606 3.3%
 
(Missing) 9190 8.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

chronic
Boolean

Distinct count3
Unique (%)< 0.1%
Missing560
Missing (%)0.5%
Memory size853.2 KiB
0
61792
1
46848
(Missing)
 
560
ValueCountFrequency (%) 
0 61792 56.6%
 
1 46848 42.9%
 
(Missing) 560 0.5%
 
Distinct count3
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size853.2 KiB
ave
90097
pred
19102
ValueCountFrequency (%) 
ave 90097 82.5%
 
pred 19102 17.5%
 
(Missing) 1 < 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.174935897
Min length3
Scatter

consumption
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2259
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean77895.93857
Minimum0.0
Maximum4392300.0
Zeros8749
Zeros (%)8.0%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1836
median4720
Q347377.5
95-th percentile376105
Maximum4392300
Range4392300
Interquartile range (IQR)46541.5

Descriptive statistics

Standard deviation244982.8421
Coefficient of variation (CV)3.14500148
Kurtosis107.3862214
Mean77895.93857
Median Absolute Deviation (MAD)107843.0769
Skewness8.442353745
Sum8506236492
Variance6.001659294e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 4.0000000e+00 6.0000000e+00 1.4500000e+01 ... 1.1802800e+06 1.2108105e+06 1.3336665e+06 1.6117350e+06 4.3923000e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 8749 8.0%
 
1025 163 0.1%
 
1097 151 0.1%
 
2427 145 0.1%
 
384 144 0.1%
 
678 144 0.1%
 
27290 144 0.1%
 
5688 141 0.1%
 
136200 136 0.1%
 
1177 133 0.1%
 
Other values (2249) 99150 90.8%
 
ValueCountFrequency (%) 
0 8749 8.0%
 
1 32 < 0.1%
 
2 20 < 0.1%
 
3 21 < 0.1%
 
5 2 < 0.1%
 
ValueCountFrequency (%) 
4392300 72 0.1%
 
3318000 72 0.1%
 
3239610 72 0.1%
 
1633120 72 0.1%
 
1590350 72 0.1%
 

consumption_previous_year
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2806
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean885070.1067
Minimum0.0
Maximum46044870.0
Zeros4354
Zeros (%)4.0%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile844.45
Q111172
median61332
Q3583110
95-th percentile4533600
Maximum46044870
Range46044870
Interquartile range (IQR)571938

Descriptive statistics

Standard deviation2696712.997
Coefficient of variation (CV)3.046891966
Kurtosis98.64178706
Mean885070.1067
Median Absolute Deviation (MAD)1205272.079
Skewness8.117374417
Sum9.664965565e+10
Variance7.27226099e+12
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 2.0000000e+00 6.5000000e+00 1.1500000e+01 ... 1.5478775e+07 1.6305360e+07 1.8004425e+07 3.6691470e+07 4.6044870e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4354 4.0%
 
12687 128 0.1%
 
9622 110 0.1%
 
11371 105 0.1%
 
17716 103 0.1%
 
5770 101 0.1%
 
4532 98 0.1%
 
454770 92 0.1%
 
3530 85 0.1%
 
35771040 72 0.1%
 
Other values (2796) 103952 95.2%
 
ValueCountFrequency (%) 
0 4354 4.0%
 
1 67 0.1%
 
3 4 < 0.1%
 
10 58 0.1%
 
13 2 < 0.1%
 
ValueCountFrequency (%) 
46044870 72 0.1%
 
37611900 72 0.1%
 
35771040 72 0.1%
 
19037280 72 0.1%
 
16971570 72 0.1%
 

Current_consumption
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2243
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103728.2482
Minimum0.0
Maximum5666490.0
Zeros9906
Zeros (%)9.1%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1982
median5879
Q362790
95-th percentile520530
Maximum5666490
Range5666490
Interquartile range (IQR)61808

Descriptive statistics

Standard deviation321749.018
Coefficient of variation (CV)3.101845674
Kurtosis101.8774158
Mean103728.2482
Median Absolute Deviation (MAD)143777.058
Skewness8.188974489
Sum1.13271247e+10
Variance1.035224306e+11
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 5.000000e-01 1.500000e+00 2.500000e+00 5.500000e+00 ... 1.677765e+06 1.707915e+06 1.974510e+06 4.293390e+06 5.666490e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 9906 9.1%
 
1023 182 0.2%
 
3355 182 0.2%
 
602 177 0.2%
 
120 175 0.2%
 
180 162 0.1%
 
763 157 0.1%
 
722 143 0.1%
 
1401 142 0.1%
 
3814 141 0.1%
 
Other values (2233) 97833 89.6%
 
ValueCountFrequency (%) 
0 9906 9.1%
 
1 1 < 0.1%
 
2 93 0.1%
 
3 32 < 0.1%
 
4 13 < 0.1%
 
ValueCountFrequency (%) 
5666490 72 0.1%
 
4368000 72 0.1%
 
4218780 72 0.1%
 
2010420 72 0.1%
 
1938600 72 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing96190
Missing (%)88.1%
Memory size853.2 KiB
0
12611
1
 
399
ValueCountFrequency (%) 
0 12611 11.5%
 
1 399 0.4%
 
(Missing) 96190 88.1%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.642582418
Min length1
Scatter

dangerous_substance
Categorical

MISSING
Distinct count3
Unique (%)< 0.1%
Missing96189
Missing (%)88.1%
Memory size853.2 KiB
0
12242
1
 
769
ValueCountFrequency (%) 
0 12242 11.2%
 
1 769 0.7%
 
(Missing) 96189 88.1%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.642554945
Min length1
Scatter

Date_of_basket_entry
Categorical

MISSING
HIGH CARDINALITY
Distinct count55
Unique (%)0.1%
Missing49905
Missing (%)45.7%
Memory size853.2 KiB
01.01.1995
28502
01.01.2000
 
5139
01.03.2001
 
3979
01.03.1999
 
3303
01.03.2002
 
2525
Other values (49)
15847
ValueCountFrequency (%) 
01.01.1995 28502 26.1%
 
01.01.2000 5139 4.7%
 
01.03.2001 3979 3.6%
 
01.03.1999 3303 3.0%
 
01.03.2002 2525 2.3%
 
01.01.2009 1691 1.5%
 
01.05.2006 1669 1.5%
 
01.03.2008 1272 1.2%
 
15.01.2015 1016 0.9%
 
01.01.2010 963 0.9%
 
Other values (44) 9236 8.5%
 
(Missing) 49905 45.7%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length7.257967033
Min length4
Scatter

Description_Alternative_purchasing_group
Categorical

MISSING
HIGH CARDINALITY
Distinct count566
Unique (%)0.5%
Missing4291
Missing (%)3.9%
Memory size853.2 KiB
Hypertension
 
3548
Bact.Inf., Sys.
 
2260
Statines
 
2037
EPILEPSY- Anticonvulsants
 
1817
Pain, Mod/Sev
 
1622
Other values (560)
93625
ValueCountFrequency (%) 
Hypertension 3548 3.2%
 
Bact.Inf., Sys. 2260 2.1%
 
Statines 2037 1.9%
 
EPILEPSY- Anticonvulsants 1817 1.7%
 
Pain, Mod/Sev 1622 1.5%
 
הורדת חום ושיכוך כאב 1570 1.4%
 
Depression-SSRI 1466 1.3%
 
Diabetes Mell. 1231 1.1%
 
BPH 1161 1.1%
 
טיפול אנטי פטרייתי 1130 1.0%
 
Other values (555) 87067 79.7%
 
(Missing) 4291 3.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length30
Mean length15.21119963
Min length2
Scatter

Description_outline
Categorical

MISSING
HIGH CARDINALITY
Distinct count307
Unique (%)0.3%
Missing8322
Missing (%)7.6%
Memory size853.2 KiB
Hypertension
 
3810
Diabetes Mell.
 
3613
food adittive
 
3037
Not Designated
 
2813
Epilepsy
 
2436
Other values (301)
85169
ValueCountFrequency (%) 
Hypertension 3810 3.5%
 
Diabetes Mell. 3613 3.3%
 
food adittive 3037 2.8%
 
Not Designated 2813 2.6%
 
Epilepsy 2436 2.2%
 
cosmetics 2290 2.1%
 
Bact.Inf., Sys. 2264 2.1%
 
Statines 2037 1.9%
 
Antidepressants 2008 1.8%
 
Pain, Mild/Mod. 1927 1.8%
 
Other values (296) 74643 68.4%
 
(Missing) 8322 7.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length30
Mean length11.98589744
Min length3
Scatter
Distinct count902
Unique (%)0.8%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
Minimum2006-03-06 00:00:00
Maximum2019-11-20 00:00:00
Mini histogram
Histogram
Histogram

European_patent_expires
Categorical

MISSING
Distinct count31
Unique (%)< 0.1%
Missing107645
Missing (%)98.6%
Memory size853.2 KiB
01.02.2015
199
01.06.2017
 
121
01.03.2016
 
120
01.10.2018
 
114
01.03.2017
 
98
Other values (25)
903
ValueCountFrequency (%) 
01.02.2015 199 0.2%
 
01.06.2017 121 0.1%
 
01.03.2016 120 0.1%
 
01.10.2018 114 0.1%
 
01.03.2017 98 0.1%
 
01.09.2019 78 0.1%
 
01.12.2022 74 0.1%
 
01.05.2020 72 0.1%
 
01.07.2018 72 0.1%
 
01.05.2019 64 0.1%
 
Other values (20) 543 0.5%
 
(Missing) 107645 98.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length4.08543956
Min length4
Scatter
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
0
97575
1
 
11625
ValueCountFrequency (%) 
0 97575 89.4%
 
1 11625 10.6%
 

For_adults
Categorical

MISSING
Distinct count2
Unique (%)< 0.1%
Missing104554
Missing (%)95.7%
Memory size853.2 KiB
1
4646
ValueCountFrequency (%) 
1 4646 4.3%
 
(Missing) 104554 95.7%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.872362637
Min length1
Scatter

Form_of_giving
Categorical

MISSING
Distinct count4
Unique (%)< 0.1%
Missing64017
Missing (%)58.6%
Memory size853.2 KiB
0
45152
17
 
25
15
 
6
ValueCountFrequency (%) 
0 45152 41.3%
 
17 25 < 0.1%
 
15 6 < 0.1%
 
(Missing) 64017 58.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length2.758992674
Min length1
Scatter

General_purpose
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
0
107484
6
 
1013
2
 
457
1
 
246
ValueCountFrequency (%) 
0 107484 98.4%
 
6 1013 0.9%
 
2 457 0.4%
 
1 246 0.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

GENERY_FATHER
Categorical

MISSING
HIGH CARDINALITY
Distinct count686
Unique (%)0.6%
Missing64846
Missing (%)59.4%
Memory size853.2 KiB
3999001541
 
220
3999001291
 
215
3999019172
 
211
3999001511
 
209
3999001491
 
208
Other values (680)
43291
ValueCountFrequency (%) 
3999001541 220 0.2%
 
3999001291 215 0.2%
 
3999019172 211 0.2%
 
3999001511 209 0.2%
 
3999001491 208 0.2%
 
3999001150 206 0.2%
 
3999001490 205 0.2%
 
3999003522 203 0.2%
 
3999001280 200 0.2%
 
3999001542 199 0.2%
 
Other values (675) 42278 38.7%
 
(Missing) 64846 59.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length10
Mean length6.437032967
Min length4
Scatter

Inventory
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2289
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103812.7952
Minimum0.0
Maximum6462780.0
Zeros12220
Zeros (%)11.2%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q11261
median5710
Q365750
95-th percentile484350
Maximum6462780
Range6462780
Interquartile range (IQR)64489

Descriptive statistics

Standard deviation361671.6745
Coefficient of variation (CV)3.483883406
Kurtosis134.8190112
Mean103812.7952
Median Absolute Deviation (MAD)144291.345
Skewness9.639427544
Sum1.133635723e+10
Variance1.308064001e+11
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 1.000000e+00 3.500000e+00 1.150000e+01 2.900000e+01 ... 2.462100e+06 2.564640e+06 3.135090e+06 6.400395e+06 6.462780e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 12220 11.2%
 
875 156 0.1%
 
6647 144 0.1%
 
400 144 0.1%
 
5258 144 0.1%
 
7306 143 0.1%
 
41412 140 0.1%
 
2208 139 0.1%
 
74370 137 0.1%
 
88620 136 0.1%
 
Other values (2279) 95697 87.6%
 
ValueCountFrequency (%) 
0 12220 11.2%
 
2 11 < 0.1%
 
5 71 0.1%
 
6 40 < 0.1%
 
7 13 < 0.1%
 
ValueCountFrequency (%) 
6462780 72 0.1%
 
6338010 72 0.1%
 
3606000 72 0.1%
 
2664180 72 0.1%
 
2465100 72 0.1%
 

Inventory_of_consumption_months
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count530
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.764411447
Minimum0.0
Maximum786.67
Zeros12700
Zeros (%)11.6%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10.75
median1.36
Q32.19
95-th percentile3.63
Maximum786.67
Range786.67
Interquartile range (IQR)1.44

Descriptive statistics

Standard deviation7.255484096
Coefficient of variation (CV)4.112127083
Kurtosis8850.945007
Mean1.764411447
Median Absolute Deviation (MAD)1.155885864
Skewness84.76938652
Sum192673.73
Variance52.64204947
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 5.00000e-03 2.50000e-02 4.00000e-02 8.00000e-02 ... 2.56650e+01 3.23100e+01 1.27780e+02 2.00445e+02 7.86670e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 12700 11.6%
 
0.73 973 0.9%
 
0.93 882 0.8%
 
0.75 808 0.7%
 
0.9 754 0.7%
 
1.53 753 0.7%
 
0.69 735 0.7%
 
0.83 734 0.7%
 
1.07 729 0.7%
 
1.11 692 0.6%
 
Other values (520) 89440 81.9%
 
ValueCountFrequency (%) 
0 12700 11.6%
 
0.01 101 0.1%
 
0.02 98 0.1%
 
0.03 25 < 0.1%
 
0.05 63 0.1%
 
ValueCountFrequency (%) 
786.67 7 < 0.1%
 
255.13 2 < 0.1%
 
145.76 21 < 0.1%
 
109.8 16 < 0.1%
 
98.52 24 < 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing190
Missing (%)0.2%
Memory size853.2 KiB
1
67552
0
41458
(Missing)
 
190
ValueCountFrequency (%) 
1 67552 61.9%
 
0 41458 38.0%
 
(Missing) 190 0.2%
 

Item_Type
Categorical

Distinct count5
Unique (%)< 0.1%
Missing483
Missing (%)0.4%
Memory size853.2 KiB
0
108262
1
 
309
6
 
74
3
 
72
ValueCountFrequency (%) 
0 108262 99.1%
 
1 309 0.3%
 
6 74 0.1%
 
3 72 0.1%
 
(Missing) 483 0.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

Loading_group
Categorical

Distinct count14
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
23
94811
11
 
4007
25
 
3439
13
 
2058
24
 
1450
Other values (9)
 
3435
ValueCountFrequency (%) 
23 94811 86.8%
 
11 4007 3.7%
 
25 3439 3.1%
 
13 2058 1.9%
 
24 1450 1.3%
 
12 1002 0.9%
 
22 908 0.8%
 
15 711 0.7%
 
14 398 0.4%
 
19 246 0.2%
 
Other values (4) 170 0.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length2
Mean length1.998525641
Min length1
Scatter

Main_outline
Categorical

MISSING
HIGH CARDINALITY
Distinct count307
Unique (%)0.3%
Missing8322
Missing (%)7.6%
Memory size853.2 KiB
13
 
3810
12
 
3613
393
 
3037
0
 
2813
14
 
2436
Other values (301)
85169
ValueCountFrequency (%) 
13 3810 3.5%
 
12 3613 3.3%
 
393 3037 2.8%
 
0 2813 2.6%
 
14 2436 2.2%
 
392 2290 2.1%
 
60 2264 2.1%
 
432 2037 1.9%
 
420 2008 1.8%
 
54 1927 1.8%
 
Other values (296) 74643 68.4%
 
(Missing) 8322 7.6%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length2.558763736
Min length1
Scatter

Material_Type
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
ZHW1
100819
ZHW3
 
8076
ZUB1
 
305
ValueCountFrequency (%) 
ZHW1 100819 92.3%
 
ZHW3 8076 7.4%
 
ZUB1 305 0.3%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length4
Min length4
Scatter

month_year
Categorical

Distinct count36
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
12/2019
 
3210
07/2018
 
3186
01/2019
 
3150
12/2018
 
3140
08/2018
 
3131
Other values (31)
93383
ValueCountFrequency (%) 
12/2019 3210 2.9%
 
07/2018 3186 2.9%
 
01/2019 3150 2.9%
 
12/2018 3140 2.9%
 
08/2018 3131 2.9%
 
10/2018 3127 2.9%
 
07/2019 3114 2.9%
 
01/2018 3107 2.8%
 
11/2019 3106 2.8%
 
10/2017 3087 2.8%
 
Other values (26) 77842 71.3%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length7
Mean length7
Min length7
Scatter
Distinct count3
Unique (%)< 0.1%
Missing68
Missing (%)0.1%
Memory size853.2 KiB
1
68982
0
40150
(Missing)
 
68
ValueCountFrequency (%) 
1 68982 63.2%
 
0 40150 36.8%
 
(Missing) 68 0.1%
 

Narcotic_/_psychotropic
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count6
Unique (%)< 0.1%
Missing49980
Missing (%)45.8%
Infinite0
Infinite (%)0.0%
Mean0.1018574806
Minimum0.0
Maximum5.0
Zeros55580
Zeros (%)50.9%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4596484669
Coefficient of variation (CV)4.512662834
Kurtosis39.88497193
Mean0.1018574806
Median Absolute Deviation (MAD)0.1911934742
Skewness5.758947642
Sum6032
Variance0.2112767132
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 55580 50.9%
 
1 2058 1.9%
 
2 1002 0.9%
 
3 465 0.4%
 
5 115 0.1%
 
(Missing) 49980 45.8%
 
ValueCountFrequency (%) 
0 55580 50.9%
 
1 2058 1.9%
 
2 1002 0.9%
 
3 465 0.4%
 
5 115 0.1%
 
ValueCountFrequency (%) 
5 115 0.1%
 
3 465 0.4%
 
2 1002 0.9%
 
1 2058 1.9%
 
0 55580 50.9%
 

Plant
Categorical

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
7350
109200
ValueCountFrequency (%) 
7350 109200 100.0%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length6
Mean length6
Min length6
Scatter

Prediction
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2189
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74872.72531
Minimum0.0
Maximum4302782.0
Zeros10520
Zeros (%)9.6%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1754
median4390
Q346016
95-th percentile364336
Maximum4302782
Range4302782
Interquartile range (IQR)45262

Descriptive statistics

Standard deviation232804.2894
Coefficient of variation (CV)3.109333718
Kurtosis110.3154294
Mean74872.72531
Median Absolute Deviation (MAD)103632.2149
Skewness8.466796196
Sum8176101604
Variance5.419783715e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 5.5000000e+00 1.6000000e+01 1.7500000e+01 ... 1.1548035e+06 1.2405160e+06 1.2994585e+06 1.5776160e+06 4.3027820e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 10520 9.6%
 
627 193 0.2%
 
465 172 0.2%
 
1466 169 0.2%
 
1294 164 0.2%
 
471 155 0.1%
 
598 154 0.1%
 
3785 144 0.1%
 
6903 144 0.1%
 
4855 144 0.1%
 
Other values (2179) 97241 89.0%
 
ValueCountFrequency (%) 
0 10520 9.6%
 
1 2 < 0.1%
 
3 1 < 0.1%
 
4 2 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
4302782 72 0.1%
 
3093749 72 0.1%
 
2913005 72 0.1%
 
1601720 72 0.1%
 
1553512 72 0.1%
 

PRICE
Real number (ℝ≥0)

SKEWED
Distinct count1485
Unique (%)1.4%
Missing5
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean41.4995585
Minimum0.0
Maximum13106.76
Zeros1
Zeros (%)< 0.1%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.14
Q10.49
median4.84
Q316.45
95-th percentile79.56
Maximum13106.76
Range13106.76
Interquartile range (IQR)15.96

Descriptive statistics

Standard deviation336.7350527
Coefficient of variation (CV)8.11418398
Kurtosis764.5607774
Mean41.4995585
Median Absolute Deviation (MAD)60.93345883
Skewness24.26847489
Sum4531544.29
Variance113390.4957
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.14 1361 1.2%
 
0.16 1058 1.0%
 
0.2 1053 1.0%
 
0.15 1050 1.0%
 
0.33 1042 1.0%
 
0.29 1022 0.9%
 
0.11 1018 0.9%
 
0.51 907 0.8%
 
11.7 891 0.8%
 
0.1 861 0.8%
 
Other values (1474) 98932 90.6%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
0.01 207 0.2%
 
0.03 232 0.2%
 
0.04 180 0.2%
 
0.05 250 0.2%
 
ValueCountFrequency (%) 
13106.76 2 < 0.1%
 
12471.82 35 < 0.1%
 
10466.91 4 < 0.1%
 
9371.7 5 < 0.1%
 
8226.27 3 < 0.1%
 

Price_for_absolute_packaging
Real number (ℝ≥0)

ZEROS
Distinct count1917
Unique (%)1.8%
Missing5
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean310.5627059
Minimum0.0
Maximum49409.36
Zeros1632
Zeros (%)1.5%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile2.2
Q17.6
median16.38
Q371.4
95-th percentile1533
Maximum49409.36
Range49409.36
Interquartile range (IQR)63.8

Descriptive statistics

Standard deviation1367.93726
Coefficient of variation (CV)4.404705505
Kurtosis189.0219014
Mean310.5627059
Median Absolute Deviation (MAD)478.1886963
Skewness11.082327
Sum33911894.67
Variance1871252.348
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 1632 1.5%
 
4.8 1280 1.2%
 
4.2 1095 1.0%
 
15.3 766 0.7%
 
3.3 734 0.7%
 
8.7 717 0.7%
 
6 700 0.6%
 
11.7 638 0.6%
 
5.7 576 0.5%
 
3 573 0.5%
 
Other values (1906) 100484 92.0%
 
ValueCountFrequency (%) 
0 1632 1.5%
 
0.03 63 0.1%
 
0.07 72 0.1%
 
0.09 89 0.1%
 
0.1 215 0.2%
 
ValueCountFrequency (%) 
49409.36 4 < 0.1%
 
41212.53 1 < 0.1%
 
35099.96 7 < 0.1%
 
34752 1 < 0.1%
 
30433.62 1 < 0.1%
 

Pure_OTC
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
0
80726
1
28474
ValueCountFrequency (%) 
0 80726 73.9%
 
1 28474 26.1%
 

Quantity_in_absolute_packaging
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count54
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.10550366
Minimum0.0
Maximum448.0
Zeros1631
Zeros (%)1.5%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11
median20
Q330
95-th percentile90
Maximum448
Range448
Interquartile range (IQR)29

Descriptive statistics

Standard deviation28.60742204
Coefficient of variation (CV)1.238121551
Kurtosis18.62121675
Mean23.10550366
Median Absolute Deviation (MAD)20.08078614
Skewness2.857420108
Sum2523121
Variance818.3845959
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 207. 217. 335. 424. 448. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 38992 35.7%
 
30 24328 22.3%
 
28 7331 6.7%
 
60 5654 5.2%
 
20 5458 5.0%
 
100 3641 3.3%
 
10 2629 2.4%
 
50 2455 2.2%
 
5 1958 1.8%
 
56 1846 1.7%
 
Other values (44) 14908 13.7%
 
ValueCountFrequency (%) 
0 1631 1.5%
 
1 38992 35.7%
 
2 1255 1.1%
 
3 799 0.7%
 
4 1409 1.3%
 
ValueCountFrequency (%) 
448 1 < 0.1%
 
400 46 < 0.1%
 
270 3 < 0.1%
 
224 3 < 0.1%
 
210 36 < 0.1%
 

Quantity_in_Packaging-Absolute
Real number (ℝ≥0)

MISSING
HIGH CORRELATION
Distinct count55
Unique (%)0.1%
Missing1467
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean23.42013125
Minimum0.0
Maximum448.0
Zeros164
Zeros (%)0.2%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11
median20
Q330
95-th percentile90
Maximum448
Range448
Interquartile range (IQR)29

Descriptive statistics

Standard deviation28.67333173
Coefficient of variation (CV)1.224302777
Kurtosis18.58325401
Mean23.42013125
Median Absolute Deviation (MAD)20.05709491
Skewness2.850608592
Sum2523121
Variance822.1599527
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 38992 35.7%
 
30 24328 22.3%
 
28 7331 6.7%
 
60 5654 5.2%
 
20 5458 5.0%
 
100 3641 3.3%
 
10 2629 2.4%
 
50 2455 2.2%
 
5 1958 1.8%
 
56 1846 1.7%
 
Other values (44) 13441 12.3%
 
(Missing) 1467 1.3%
 
ValueCountFrequency (%) 
0 164 0.2%
 
1 38992 35.7%
 
2 1255 1.1%
 
3 799 0.7%
 
4 1409 1.3%
 
ValueCountFrequency (%) 
448 1 < 0.1%
 
400 46 < 0.1%
 
270 3 < 0.1%
 
224 3 < 0.1%
 
210 36 < 0.1%
 

Quantity_in_packing-relative
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count89
Unique (%)0.1%
Missing3430
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean25.56672497
Minimum0.0
Maximum1000.0
Zeros73588
Zeros (%)67.4%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q35
95-th percentile125
Maximum1000
Range1000
Interquartile range (IQR)5

Descriptive statistics

Standard deviation83.55999548
Coefficient of variation (CV)3.268310493
Kurtosis40.42127355
Mean25.56672497
Median Absolute Deviation (MAD)40.07850571
Skewness5.583719107
Sum2704192.5
Variance6982.272844
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 73588 67.4%
 
15 3639 3.3%
 
100 2884 2.6%
 
10 2528 2.3%
 
50 2375 2.2%
 
30 2283 2.1%
 
5 1815 1.7%
 
20 1412 1.3%
 
500 1201 1.1%
 
3 967 0.9%
 
Other values (78) 13078 12.0%
 
(Missing) 3430 3.1%
 
ValueCountFrequency (%) 
0 73588 67.4%
 
0.1 14 < 0.1%
 
0.2 20 < 0.1%
 
0.3 44 < 0.1%
 
0.4 182 0.2%
 
ValueCountFrequency (%) 
1000 138 0.1%
 
750 70 0.1%
 
500 1201 1.1%
 
454 61 0.1%
 
450 148 0.1%
 

Safety_Stock
Real number (ℝ≥0)

ZEROS
HIGH CORRELATION
Distinct count2158
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72244.11013
Minimum0.0
Maximum5242283.0
Zeros11178
Zeros (%)10.2%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q1697
median3116
Q340883
95-th percentile295952
Maximum5242283
Range5242283
Interquartile range (IQR)40186

Descriptive statistics

Standard deviation287676.8711
Coefficient of variation (CV)3.982011413
Kurtosis149.9582291
Mean72244.11013
Median Absolute Deviation (MAD)103397.3879
Skewness10.43211316
Sum7889056826
Variance8.275798214e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 3.5000000e+00 4.5000000e+00 6.5000000e+00 ... 1.7691890e+06 2.0339505e+06 2.9453580e+06 5.1687510e+06 5.2422830e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 11178 10.2%
 
1235 168 0.2%
 
1392 160 0.1%
 
461 149 0.1%
 
736 146 0.1%
 
1796 146 0.1%
 
49325 144 0.1%
 
1635 144 0.1%
 
143220 143 0.1%
 
13625 142 0.1%
 
Other values (2148) 96680 88.5%
 
ValueCountFrequency (%) 
0 11178 10.2%
 
1 24 < 0.1%
 
3 41 < 0.1%
 
4 1 < 0.1%
 
5 22 < 0.1%
 
ValueCountFrequency (%) 
5242283 72 0.1%
 
5095219 72 0.1%
 
3274632 72 0.1%
 
2616084 72 0.1%
 
2266471 72 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing15
Missing (%)< 0.1%
Memory size853.2 KiB
no
92271
yes
 
16914
(Missing)
 
15
ValueCountFrequency (%) 
no 92271 84.5%
 
yes 16914 15.5%
 
(Missing) 15 < 0.1%
 

Send_code_to_Omri
Boolean

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
1
109200
ValueCountFrequency (%) 
1 109200 100.0%
 

Serving_form
Categorical

MISSING
HIGH CARDINALITY
Distinct count53
Unique (%)< 0.1%
Missing9531
Missing (%)8.7%
Memory size853.2 KiB
TAB
44496
CAP
10089
CR
 
5930
COL
 
3767
CPL
 
3095
Other values (47)
32292
ValueCountFrequency (%) 
TAB 44496 40.7%
 
CAP 10089 9.2%
 
CR 5930 5.4%
 
COL 3767 3.4%
 
CPL 3095 2.8%
 
GEL 2487 2.3%
 
OIN 2386 2.2%
 
SOL 2137 2.0%
 
INJ 1959 1.8%
 
LIQ 1921 1.8%
 
Other values (42) 21402 19.6%
 
(Missing) 9531 8.7%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.03297619
Min length2
Scatter

skucode2
Categorical

HIGH CARDINALITY
Distinct count3281
Unique (%)3.0%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
10000005285
 
72
10000003428
 
72
10000002589
 
72
10000005408
 
72
10000002681
 
72
Other values (3276)
108840
ValueCountFrequency (%) 
10000005285 72 0.1%
 
10000003428 72 0.1%
 
10000002589 72 0.1%
 
10000005408 72 0.1%
 
10000002681 72 0.1%
 
10000004917 72 0.1%
 
10000006948 72 0.1%
 
10000005390 72 0.1%
 
10000006378 72 0.1%
 
10000003484 72 0.1%
 
Other values (3271) 108480 99.3%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length11
Mean length11
Min length11
Scatter

Sourcing_source
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
ארץ
108533
חו"ל
 
667
ValueCountFrequency (%) 
ארץ 108533 99.4%
 
חו"ל 667 0.6%
 

Composition

Contains charsFalse
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length4
Mean length3.006108059
Min length3
Scatter

status
Boolean

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
1
109200
ValueCountFrequency (%) 
1 109200 100.0%
 

storecode
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
3224
55436
3315
53764
ValueCountFrequency (%) 
3224 55436 50.8%
 
3315 53764 49.2%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length6
Mean length6
Min length6
Scatter

Toxic_item
Categorical

MISSING
Distinct count5
Unique (%)< 0.1%
Missing8884
Missing (%)8.1%
Memory size853.2 KiB
0
99828
1
 
399
3
 
70
2
 
19
ValueCountFrequency (%) 
0 99828 91.4%
 
1 399 0.4%
 
3 70 0.1%
 
2 19 < 0.1%
 
(Missing) 8884 8.1%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

TranQuantity
Real number (ℝ≥0)

Distinct count2101
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.1847253
Minimum0.0
Maximum12780.0
Zeros546
Zeros (%)0.5%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q13
median20
Q3126
95-th percentile950
Maximum12780
Range12780
Interquartile range (IQR)123

Descriptive statistics

Standard deviation589.8659505
Coefficient of variation (CV)3.006686426
Kurtosis90.78169812
Mean196.1847253
Median Absolute Deviation (MAD)261.9104563
Skewness7.862365333
Sum21423372
Variance347941.8396
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 1.5000e+00 2.5000e+00 3.5000e+00 ... 3.8005e+03 4.6850e+03 7.8100e+03 1.0565e+04 1.2780e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 13075 12.0%
 
2 9063 8.3%
 
3 6368 5.8%
 
4 4917 4.5%
 
60 4242 3.9%
 
30 3960 3.6%
 
5 3585 3.3%
 
6 3013 2.8%
 
120 2453 2.2%
 
90 2226 2.0%
 
Other values (2091) 56298 51.6%
 
ValueCountFrequency (%) 
0 546 0.5%
 
1 13075 12.0%
 
2 9063 8.3%
 
3 6368 5.8%
 
4 4917 4.5%
 
ValueCountFrequency (%) 
12780 2 < 0.1%
 
12420 1 < 0.1%
 
12240 1 < 0.1%
 
11460 2 < 0.1%
 
11264 1 < 0.1%
 

type_of_packeging
Categorical

MISSING
Distinct count7
Unique (%)< 0.1%
Missing102708
Missing (%)94.1%
Memory size853.2 KiB
BOX
4341
TUB
932
BOT
926
PKG
 
250
BAG
 
27
ValueCountFrequency (%) 
BOX 4341 4.0%
 
TUB 932 0.9%
 
BOT 926 0.8%
 
PKG 250 0.2%
 
BAG 27 < 0.1%
 
KIT 16 < 0.1%
 
(Missing) 102708 94.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.940549451
Min length3
Scatter

U#S#_Patent_Expires
Categorical

MISSING
Distinct count34
Unique (%)< 0.1%
Missing107330
Missing (%)98.3%
Memory size853.2 KiB
01.03.2016
 
162
01.02.2018
 
141
01.10.2020
 
136
01.07.2018
 
130
01.10.2018
 
114
Other values (28)
1187
ValueCountFrequency (%) 
01.03.2016 162 0.1%
 
01.02.2018 141 0.1%
 
01.10.2020 136 0.1%
 
01.07.2018 130 0.1%
 
01.10.2018 114 0.1%
 
01.05.2020 108 0.1%
 
01.03.2017 98 0.1%
 
01.12.2019 87 0.1%
 
01.03.2020 82 0.1%
 
01.06.2018 78 0.1%
 
Other values (23) 734 0.7%
 
(Missing) 107330 98.3%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length4.102747253
Min length4
Scatter

Validity_of_Ministry_of_Health_registration
Categorical

MISSING
HIGH CARDINALITY
Distinct count83
Unique (%)0.1%
Missing103787
Missing (%)95.0%
Memory size853.2 KiB
28.02.2015
 
297
30.04.2015
 
296
30.11.2012
 
291
30.11.2013
 
216
31.01.2007
 
216
Other values (77)
4097
ValueCountFrequency (%) 
28.02.2015 297 0.3%
 
30.04.2015 296 0.3%
 
30.11.2012 291 0.3%
 
30.11.2013 216 0.2%
 
31.01.2007 216 0.2%
 
30.09.2012 170 0.2%
 
31.10.2013 144 0.1%
 
31.03.2015 143 0.1%
 
31.05.2014 138 0.1%
 
31.01.2008 134 0.1%
 
Other values (72) 3368 3.1%
 
(Missing) 103787 95.0%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length10
Mean length4.297417582
Min length4
Scatter

VENDOR
Categorical

HIGH CARDINALITY
Distinct count155
Unique (%)0.1%
Missing680
Missing (%)0.6%
Memory size853.2 KiB
400059
 
10422
400057
 
5992
408668
 
5839
400095
 
5065
400045
 
4310
Other values (149)
76892
ValueCountFrequency (%) 
400059 10422 9.5%
 
400057 5992 5.5%
 
408668 5839 5.3%
 
400095 5065 4.6%
 
400045 4310 3.9%
 
406885 4250 3.9%
 
407214 3381 3.1%
 
400026 3330 3.0%
 
410232 3310 3.0%
 
400032 2641 2.4%
 
Other values (144) 59980 54.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length6
Mean length5.987545788
Min length4
Scatter

volume
Real number (ℝ≥0)

Distinct count1971
Unique (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.851421
Minimum0.0
Maximum9973.0
Zeros64
Zeros (%)0.1%
Memory size853.2 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile3.072
Q16.239
median84
Q3245.784
95-th percentile744
Maximum9973
Range9973
Interquartile range (IQR)239.545

Descriptive statistics

Standard deviation368.6860182
Coefficient of variation (CV)1.872915199
Kurtosis71.22447651
Mean196.851421
Median Absolute Deviation (MAD)206.2782965
Skewness6.116605493
Sum21496175.17
Variance135929.38
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 1.1300e-01 8.4400e-01 9.6000e-01 1.0200e+00 ... 4.2755e+03 4.3515e+03 4.4655e+03 6.4060e+03 9.9730e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3.467 729 0.7%
 
3.5 620 0.6%
 
3.433 541 0.5%
 
3.967 500 0.5%
 
180 479 0.4%
 
7.333 406 0.4%
 
6.5 388 0.4%
 
3.6 384 0.4%
 
8.1 382 0.3%
 
99 372 0.3%
 
Other values (1961) 104399 95.6%
 
ValueCountFrequency (%) 
0 64 0.1%
 
0.226 3 < 0.1%
 
0.778 14 < 0.1%
 
0.91 72 0.1%
 
1.01 14 < 0.1%
 
ValueCountFrequency (%) 
9973 2 < 0.1%
 
8258 13 < 0.1%
 
4554 72 0.1%
 
4377 21 < 0.1%
 
4326 65 0.1%
 

Volume_marker
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size853.2 KiB
LOW
69153
HIGH
40047
ValueCountFrequency (%) 
LOW 69153 63.3%
 
HIGH 40047 36.7%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length4
Mean length3.366730769
Min length3
Scatter

Correlations

Missing values

Sample

First rows

A_group_of_materialsABCAffiliation_GroupAlternative_purchasing_groupATCBasic_unit_of_measureBudget_GroupChemical_/_biologicalchronicConsumer_managementconsumptionconsumption_previous_yearCurrent_consumptionDangerous_material_is_charged_with_a_permitdangerous_substanceDate_of_basket_entryDescription_Alternative_purchasing_groupDescription_outlineEstablishment_date_of_the_itemEuropean_patent_expiresExceptional_policyFor_adultsForm_of_givingGeneral_purposeGENERY_FATHERInventoryInventory_of_consumption_monthsItem_in_health_basketItem_TypeLoading_groupMain_outlineMaterial_Typemonth_yearMust_prescribeNarcotic_/_psychotropicPlantPredictionPRICEPrice_for_absolute_packagingPure_OTCQuantity_in_absolute_packagingQuantity_in_Packaging-AbsoluteQuantity_in_packing-relativeSafety_StockSeasonalitySend_code_to_OmriServing_formskucode2Sourcing_sourcestatusstorecodeToxic_itemTranQuantitytype_of_packegingU#S#_Patent_ExpiresValidity_of_Ministry_of_Health_registrationVENDORvolumeVolume_marker
020140B151180ATC:G03AA09PACB1.00.0ave6369.083062.08491.0NoneNone01.04.2005Oral contraceptiveOral Contra.2006-03-06None0.0None00.0None8811.01.381.00.023171ZHW103/20171.00.07350.07351.09.36196.560.021.021.00.04671.0no1.0TAB10000005719ארץ1.03224.00.011.0BOXNoneNone408668101.000LOW
120190C201054ATC:N06AA04EAB1.01.0ave26450.0306420.034920.0NoneNone01.01.1995Depression-TricyclicAntidepressants2006-03-06None0.0None00.0399900542268550.02.591.00.023420ZHW102/20191.00.07350.025312.00.216.300.030.030.00.031010.0no1.0TAB10000006427ארץ1.03224.00.0120.0NoneNoneNone4000953.733LOW
220270C241351ATC:D08AX08EAE1.00.0ave1321.013544.02220.000Noneחיטוי ידייםcosmetics2016-05-18None0.0NoneNone0.0None875.00.660.00.023392ZHW107/20170.0NaN7350.01231.014.0414.041.01.01.0500.0924.0no1.0GEL10000001880ארץ1.03315.00.02.0NoneNoneNone124207941.000HIGH
310200C72NoneZRP:102US01EADNaN1.0ave0.00.00.0NoneNone01.03.2001NoneNone2006-03-06None0.0NoneNone0.0None0.00.001.00.023NoneZHW311/20181.0NaN7350.00.018.8118.810.01.01.00.00.0no1.0None10000007355ארץ1.03315.0NaN5.0NoneNoneNone400044198.360HIGH
420180C191156ATC:M01AH01EAB1.00.0ave47073.0525410.059330.0NoneNoneNoneNSAIDS-COX 2 inhibtorsRheum. Arth.2006-03-06None0.0None00.0399901065587390.01.860.00.02365ZHW102/20171.00.07350.045373.00.565.600.010.010.00.053858.0no1.0CAP10000006732ארץ1.03224.00.030.0NoneNoneNone40007710.500LOW
520120B131605ATC:C08CA01EAB1.01.0ave0.00.00.0NoneNone01.01.2000Calcium Chanell BlockersCalcium Chanell Blockers2006-09-26None0.0NoneNone0.039990191720.00.001.00.023431ZHW104/20171.00.07350.00.00.142.800.020.020.00.00.0no1.0TAB10000004522ארץ1.03315.00.03220.0NoneNoneNone4000596.300HIGH
610150C72NoneZRP:102UB01EADNaN1.0ave0.00.00.0NoneNone01.03.2001NoneNone2009-06-03None0.0NoneNone0.0None0.00.001.00.023NoneZHW303/20191.0NaN7350.00.032.830.000.00.0NaNNaN0.0no1.0None10000003897ארץ1.03315.0NaN4.0NoneNoneNone400044548.625HIGH
720220C221946ATC:S01XA30PACE1.00.0ave1429.017751.01852.0NoneNoneNoneטיפות ליובש בעינייםArtificial Tears2010-12-27None0.0NoneNone0.0None2952.02.070.00.023128ZHW105/20180.0NaN7350.01490.025.74823.681.032.032.00.91544.0no1.0COL10000003478ארץ1.03315.00.010.0NoneNoneNone400006505.000LOW
820100B111354ATC:A01AB09EAB1.00.0ave3480.039454.04049.0NoneNone01.01.1995טיפול אנטי פטרייתיFungal Inf, Oroph.2006-03-06None0.0None00.0None2510.00.721.00.02383ZHW101/20180.00.07350.03255.013.0613.060.01.01.040.01740.0no1.0GEL10000006295ארץ1.03315.00.04.0NoneNone30.04.2012117469214.000LOW
910100C241429ZRP:A21ADEAENaN0.0ave514.05821.0712.0NoneNoneNoneסד תמיכה לידNone2006-03-06None0.0NoneNone0.0None1282.02.490.00.023NoneZHW305/20180.0NaN7350.0482.029.2529.251.01.01.00.0607.0no1.0None10000006992ארץ1.03224.0NaN2.0NoneNoneNone400123773.000LOW

Last rows

A_group_of_materialsABCAffiliation_GroupAlternative_purchasing_groupATCBasic_unit_of_measureBudget_GroupChemical_/_biologicalchronicConsumer_managementconsumptionconsumption_previous_yearCurrent_consumptionDangerous_material_is_charged_with_a_permitdangerous_substanceDate_of_basket_entryDescription_Alternative_purchasing_groupDescription_outlineEstablishment_date_of_the_itemEuropean_patent_expiresExceptional_policyFor_adultsForm_of_givingGeneral_purposeGENERY_FATHERInventoryInventory_of_consumption_monthsItem_in_health_basketItem_TypeLoading_groupMain_outlineMaterial_Typemonth_yearMust_prescribeNarcotic_/_psychotropicPlantPredictionPRICEPrice_for_absolute_packagingPure_OTCQuantity_in_absolute_packagingQuantity_in_Packaging-AbsoluteQuantity_in_packing-relativeSafety_StockSeasonalitySend_code_to_OmriServing_formskucode2Sourcing_sourcestatusstorecodeToxic_itemTranQuantitytype_of_packegingU#S#_Patent_ExpiresValidity_of_Ministry_of_Health_registrationVENDORvolumeVolume_marker
10919020130C141519ATC:D02AC02EAE1.00.0pred900.09426.01085.0NoneNoneNoneתכשיר רחצה טיפולי - תינוקותDry Skin2010-06-09None0.0NoneNone0.0None1563.01.740.00.025142ZHW106/20180.0NaN7350.0951.030.1430.141.01.01.0500.0760.0yes1.0OIL10000003622ארץ1.03315.00.02.0NoneNoneNone400044985.000HIGH
10919120160A171651ATC:J07AJ52EAB2.00.0pred0.00.00.0NoneNoneNoneDPT Vac.DPT Vac.2011-03-22None1.0NoneNone0.039990021400.00.001.00.01196ZHW101/20171.0NaN7350.00.063.1863.180.01.01.00.50.0no1.0SRG10000003406ארץ1.03315.00.00.0NoneNoneNone400164159.600LOW
10919220240B241401ATC:V06DB15EAB1.00.0ave15000.0196110.019770.0NoneNone01.03.2008מזון ייעודיMetab. Disord.2014-11-04None0.0NoneNone0.0None10860.00.721.00.025107ZHW104/20190.0NaN7350.016335.09.019.011.01.01.0220.08916.0no1.0LIQ10000002408ארץ1.03315.00.060.0NoneNoneNone401882465.100HIGH
10919320180B191076ATC:M04AC01EAB1.01.0ave463260.05311080.0619170.0NoneNone01.01.1995FMFGout2006-03-06None0.0None00.0None338070.00.731.00.02375ZHW101/20171.05.07350.0451081.00.3510.500.030.030.00.0231630.0no1.0TAB10000006474ארץ1.03224.00.01300.0NoneNoneNone4000452.933HIGH
10919420190C201639ATC:N06AX26EAB1.01.0ave8727.095368.012180.000NoneDepressionDepression2016-05-16None0.0NoneNone0.0None19376.02.220.00.02311ZHW111/20191.0NaN7350.08732.04.22118.160.028.028.00.09567.0no1.0TAB10000001890ארץ1.03224.00.028.0NoneNoneNone4019523.429LOW
10919520190C201823ATC:N02AA05EAB1.00.0ave693.08445.0861.0NoneNoneNonePain, Mod/SevPain, Mod/Sev2013-10-14None0.0NoneNone0.0None812.01.170.00.01359ZHW103/20181.01.07350.0731.032.7632.760.01.01.030.0778.0no1.0SYR10000002704ארץ1.03315.00.04.0NoneNoneNone400045168.000LOW
10919620130C141352ATC:D08AG02EAB1.00.0pred6112.074934.06830.0NoneNone01.01.1995חיטוי נגעים בעורAntiseptic2006-03-06None0.0None00.039990211505258.00.861.00.023135ZHW103/20180.00.07350.04924.03.513.510.01.01.015.03447.0yes1.0OIN10000006784ארץ1.03315.00.019.0NoneNoneNone400057138.000HIGH
10919720100C111375ATC:A07BB01EAB1.00.0ave69973.0637500.0116040.0NoneNone01.01.1995טיפול בשלשול וכאב בטןDiarrhea2013-07-01None0.0NoneNone0.0399900254099960.01.431.00.02347ZHW106/20190.0NaN7350.053466.00.295.800.020.020.00.081947.0no1.0TAB10000002793ארץ1.03315.00.0240.0NoneNoneNone4000266.050LOW
10919820150C161694ATC:H02AB07EAB1.01.0ave169400.01913000.0221300.0NoneNone01.01.1995GlucocorticoidGlucocorticoid2006-03-06None0.0None00.0None202900.01.201.00.023141ZHW109/20171.00.07350.0162085.00.1515.000.0100.0100.00.0180690.0no1.0TAB10000006612ארץ1.03224.00.0300.0NoneNoneNone4000570.910LOW
10919920220B221082ATC:S01EA05EAA1.01.0ave9242.0108414.012071.0NoneNone01.10.2005Glaucoma alpha 2 agonistGlaucoma2006-03-06None0.0None00.039990014446359.00.691.00.02315ZHW109/20191.00.07350.08783.016.3816.380.01.01.05.04621.0no1.0COL10000005869ארץ1.03315.00.040.0NoneNoneNone40003682.000LOW